INTRODUCTION To EXCEL J 


A AfterWork = 


BME LEARNING OUTCOMES 


e l|can perform data cleaning techniques to prepare data for further 


analysis. 
| can perform data analysis techniques to answer business problems. 


e 
e |can perform data visualisation techniques to communicate insights 
to stakeholders. 
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BMEa INTRODUCTION 


Spreadsheets 
e Most familiar use is data presentation. 


e Formulas are useful for computing sales, budgets, and other numeric 
summaries. 


e Convenient for analysis of structured data and data visualisation. 


Interesting facts 


e A large fraction of the world's structured data is managed and 
manipulated in spreadsheets. 


e Microsoft Excel is dominant tool; many features, but proprietary and 
expensive. 


e Google Sheets is open and free; fewer features. 
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BMEa PRACTICE: DATA CLEANING 


Concepts 
e Data Cleaning Techniques 


Exercise 
e Perform the data cleaning techniques outlined in the given guidelines 
document and save the file for further analysis. 


e Data Cleaning guidelines: https://bit.ly/DataCleaningGuidelines 


Dataset 
e https://bit.ly/DirtyTitanicDataset 
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BMEa PRACTICE: IMPORTING & EXPORTING 


Concepts 
e Importing data 
e Exporting data 


Exercise 
e Add new column to the left of column F called Fahrenheit. 
e Use formula to compute values from celcius column E. 
o Note: Fahrenheit =1.8C + 32 
e Export the data as a CSV file. 


Dataset 
e https://bit.ly/CitiesDatasetl 
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BMEaE ICE BREAKER 


Come up with as many questions as possible that can be 
answered from the clean titanic dataset. 
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BEME PRACTICE: DATA ANALYSIS 


Concepts 
e Sorting, Filtering, Aggregation, Joining, Pivot Tables: Restructuring / 
aggregation / analysis 
Exercise 
e How many cities in Italy? 
e What is the average latitude: 


o Overall? 
o For cities with temperature < 10 ? 
o For cities with temperature > 1O ? 
o For cities where both the city name and the country name end in the 
letter "a" ? 
o Which are warmer on average - cities in the EU or cities not in the EU? 
Dataset 


e https://bit.ly/CitiesDatasetl and https://bit.ly/CountriesDatasetl 
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BMEea PRACTICE: DATA ANALYSIS (TITANIC) 


Concepts 

e Sorting, Filtering, Aggregation, Joining, Pivot Tables: Restructuring / 

Aggregation / Analysis. 

Exercise 

e How many passengers sailed for free? 
How many married women over age 50 embarked in Cherbourg? 
How many passenger ages are missing? 
What is the average fare paid by these passengers? 
What is the most common last name among passengers, and how 
many passengers have that last name? What is the average number 
of passengers per last name? 
What is the average fare paid by passengers in the three classes? 

e What is the average age of passengers in the three classes? 
Dataset 

e Clean Titanic Dataset 
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BEMEa PRACTICE: DATA VISUALISATION 


e Effective Data visualisation 
o Hans Rosling's 200 Countries, 200 Years, 4 Minutes: 
m https:/www.youtube.com/watch?v=jbkSRLYSojo 
Suggested charts are often good ones. 
For help while working with charts i.e. Web search or fiddling 
Don't underestimate the power of basic visualizations 
Bar charts, Pie charts, Scatterplots and Sparklines 
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Ma PRACTICE: BAR CHARTS 


e Concept 
o Bar charts: When one axis is categories and the other is numeric. 


e Exercise 
o Ten countries with the highest population, bar chart showing 


populations. 


e Dataset 
o https:///bit.ly/CountriesDatasetl 
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Ra PRACTICE: PIE CHARTS 


e Concept 
o Pie charts: To compare relative sizes of categories. 


e Exercise 
o Pie chart showing relative number of cities with negative longitude 


and positive longitude. 


e Dataset 
o https///bit.ly/CitiesDatasetl 
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BMEa PRACTICE: SCATTER PLOTS 


e Concept 
o Scatter plots: When both axes are numeric. 


e Exercise 
o Use a scatter plot to explore whether there is any relationship 
between the latitude of cities in a country (x-axis) and the population 


of that country (y-axis). 


e Dataset 
o https://bit.ly/CitiesDatasetl and https://bit.ly/CountriesDatasetl 
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E PRACTICE: DATA VISUALISATION (TITANIC) 


Concepts 


Data visualisation Techniques 


Exercise 


How many passengers sailed for free? 


e How many married women over age 50 embarked in Cherbourg? 
e How many passenger ages are missing? 
e What is the average fare paid by these passengers? 
e What is the most common last name among passengers, and how 
e many passengers have that last name? What is the average number 
e of passengers per last name? 
e What is the average fare paid by passengers in the three classes? 
e What is the average age of passengers in the three classes? 
Dataset 


Clean Titanic Dataset 
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